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ABSTRACT 
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19 

The embodiments of the invention in which an exclusive property 
or privilege is claimed are defined as follows: 

1. An isolated genomic DNA molecule comprising the nucleotide 
sequence of Figure 2 (Sequence ID No.:l). 

2. An isolated DNA molecule having the nucleotide sequence of ■ 
nucleotides 2033 to 13636 of Figure 2 

(Sequence ID No.:20). 

3. An isolated DNA molecule having the nucleotide sequence of 
nucleotides 109 to 1764 of Figure 2 (Sequence ID No.:21). 



4. An isolated DNA molecule having the nucleotide sequence of 
nucleotides 2216 to 3937 of Figure 2 

(Sequence ID No. :22). 

5. An isolated DNA molecule having the nucleotide sequence of 
nucleotides 3940 to 5481 of Figure 2 

(Sequence ID No. :23). 

6. An isolated DNA molecule having the nucleotide sequence of 
nucleotides 5654 to 6595 of Figure 2 

(Sequence ID No. :24). 

7. An isolated DNA molecule having the nucleotide sequence of 
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(^uenceIDNo,:25). 

8. An isolated DNA molecule having the nucleotide sequence of 
nucleotides 7895 to 9076 of Figure 2 

(Sequence ID No. :26). 

9. An isolated DNA molecule having the nucleotide sequence of 
nucleotides 9241 to 10908 of Figure 2 

(Sequence ID No. :27). 

10. An isolated DNA molecule having the nucleotide sequence of 
nucleotides 10998 to 12296 of Figure 2 

(Sequence ID No. :28). 

11. An isolated DNA molecule having the nucleotide sequence of 
nucleotides 12622 to 13365 of Figure 2 

(Sequence ID No. :29). 

12. An isolated DNA molecule having the nucleotide sequence of 
nucleotides 13769 to 14995 of Figure 2 

(Sequence ID No, :30). 

13. An isolated DNA molecule comprising a nucleotide sequence 
encoding the amino acid sequence of Figure 10. 

14. An isolated DNA molecule comprising a nucleotide sequence 
encoding the amino acid sequence of Figure 11. 

15. An isolated DNA molecule comprising a nucleotide sequence 
encoding the amino acid sequence of Figure 12. 

16. An isolated DNA molecule comprising a nucleotide sequence 
encoding the amino acid sequence of Figure 13. 

17. An isolated DNA molecule comprising a nucleotide sequence 
encoding the amino acid sequence of Figure 14, 

21 

18. An isolated DNA molecule comprising a nucleotide sequence 
encoding the amino acid sequence of Figure 15. 

19. An isolated DNA molecule comprising a nucleotide sequence 
encoding the amino acid sequence of Figure 16. 

20. An isolated DNA molecule comprising a nucleotide sequence 
encoding the amino acid sequence of Figure 17. 

21. An isolated DNA molecule comprising a nucleotide sequence 
encoding the amino acid sequence of Figure 18. 

. 22. An isolated DNA molecule comprising a nucleotide sequence 
encoding the amino acid sequence of Figure 19. 
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ITaii isolated protein having the amino acid sequence of Figure 
10. 

24. An isolated protein having the amino acid sequence of Figure 
11. 

25. An isolated protein having the amino acid sequence of Figure 
12. . 

26. An isolated protein having the amino acid sequence of Figure 
13. 

27. An isolated protein having the amino acid sequence of Figure 
14. 

28. An isolated protein having the amino acid sequence of Figure 
15. 

22 

29. An isolated protein having the amino acid sequence of Figure 
16. 

30. An isolated protein having the amino acid sequence of Figure 
17. 

31. An isolated protein having the amino acid sequence of Figure 
18. 

32. An isolated protein having the amino acid sequence of Figure 
19. 

33. A recombinant vector comprising a DNA molecule in 
accordance with any of claims 1 to 22, 

34. A host transformed with a recombinant vector comprising a 
DNA molecule in accordance with any of claims 1 to 22. 

35. A host transformed with a recombinant vector in accordance 
with claim 2 wherein the host is a Streptomycete. 

36. A host in accordance with claim 35 which is S. lividans. 

37. A process for producing clavulanic acid in a non-clavulanate- 
producing host comprising transforming the host v^th a DNA 
molecule in accordance with claim 2 and culturing the host under 
suitable conditions to produce clavulanic acid. 

38. A process for producing clavulanic acid in accordance with 
claim 37 wherein the host is S. lividans. 

' 39. A process for enhancing clavulanic acid production in a 
clavulanate-producing host comprising 
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23 transforming the host with a DNA molecule comprising a 
nucleotide sequence encoding one or more of the enzymes of the 
clavulanate synthetic pathway. 
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ttI03YWTKBSJ S 

Tlll.Ji Invention relnteEf to m<?thod.a for the production 
s Qt the antibiotic, cldvulemtc acid. 

BflrKqro""^ of the_lnyqntlan 

clavulaniu wjlti is a broad spectrum b«ta*).actAifiasc 
inhibitor and i* an Important mntibtotic for the 

10 treatffleT^t of infect ic»^>tf dLse^ees. It is produced 

cQTmnercxally by the cfrawi-poeitlve mycelial prokaryotO 
gj^ yj^j^tecmy^ftg clavuLiagg ua. wMcb also produC^iJ the i?- 
laotAXa antibiotics pcnion.^Ln de^nC^OtOXy 
ceplialOBphoriJl C An(t cephamycin Until recently, 

13 hov-ttver, th« pathv^ay eraploywjl Cor clavulanic aotd 
biosyntbesio waa much lose well understood thar tHa 
pathways leadirtg to these othar ant iuLo tics. 

Without fcnovilftdqre of the p^itliway Cor clavul^inlo acid 
biosyntheelflj it was not posftlble. to l&olato tiie. genee 

20 coding for tfid Koy ensiyniea and to memlp-ulate thesa Cfcne* 
to inorod^ct antibiotic yiald or perjplt poroduction of the 
antibiotl.c in heterolu^ouA jiyatema. 

One of th» OBirliftSst enaymes Of tll« pathway to be 
purified and cbaractBrieedwas oletvaminic acid synthRsa.. 

25 Two i^o^yiBies hava nww bc^n tdentifisd and eliaraeterisad 

(Marsh Bt al., £1992), BlocKsin., vol., 31, pp. 136^8-<57) . 

Suropeajl ps^tent Application C^^ai^i describes a PWA 
raatriction fragment encodLng a portion of th* genetic 
informatioii involved in clavulAhiC! ttcid ayntbeaifi but 

3 0 provides no sequerce infortnntLon. 

Untjt L the vork of the present invsntur$, tbe 
complete cortE»lom«rxt of genea re<|uiroa for clavulanlo aold 
synthesis had not been idantifiCfU The prag*>tlt inventor a 
have nm< iaolatsdf cAor^fid and aequBftCdd an 13. (» fcb 

3 3 genomic OKA sequence frojrt fi. n 1 ftv ul.ioregu a -rfhiCsh codes for 
slgtit proteins and sn^blcs th,9 production Ot cXavulanic 
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acid by bjrunaf omanbii of aon-cXavu]anic^prDdaCfin<f 



Btary of the Invention 
An is Dinted gunoiiilo UNA molecule is piovi^od 
cojnprlsitjg th() miolcotldc sequence eet out in rlqurs 2. 
A procftf-p: tps provided for producing oLavnlsTiic acid In « 
transforroant of a noxi-ul&vulanat.e -producing host. 



10 DASCfintton of liravinoa 

1*113 invention, as exemplified by a proTerred 
Gffibodii&cnt, lit d«scrih«d vith rsfit^oncA to the 
aooompanviog drawings In «/hicfh; 

Figure 1 shows the terminal amino 'Acid peqiience of 

13 and bJlC nucieohide sequsnce of a |;^robe (sequenco IE> 

Mo, J 2) directed to the unJoriJrK«3 reqion of th4 poqti^ca. 

Figure 2 (2-1 to Z-^io) shows tho nuoLeotlde flaquencd 
(S«qu«no* ID Ko. :l) of a 15 KJa gcporaio OUA fragment froro 
S. alavnjliaerua . The ncqucnoeB of the ten OHPs vitl^ln 

ZO the fragment ftru £hovn in upper case l«ttcr£: the 

int^i^dnic rogiopia shown in lower oaee letters. Tlio 
locations of the beginning and end of each OR? ar» also 
Indicated directly abov« t^xfi nucleotide «e^u&no«, 
AatatiE(Ks ftbove the sequenca indicate th« Eoo Rt aibes 

29 Which narK tine |»eq inning and end of th« portloTi of thd 
DNA sequence which contains all the genetic information 
for olavulanic &cld ayntheais. 

Flqure 3 shows the location of the open reacting 
frames downstreAn from pcbc . 

30 Figur* 4 ahowB a gwirtial restrlaniou map of the BKA 
sequence of Figure 2 in the region surround ing .tfla - 
(CRM) . 

Pigura 5 shows a shuttle vector used *or disruption 
of th« ola 9'&Yie. 
3 3 Figure 6 ehuws a photograph of an «gar p'J*te bearing 

culture* Ot- S- lividana trnntfformftpta. 
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Figure 7 sho>i'« Rn AHqnmewt of the OTalno acid 
s<3qd«n0O oC OlJ\ r n> clavul iner-o.^ CTtA) v/ith tbcac OJT Ej_ 
Cft^ t Jigmatine ureohydrolctso f H. Coll AUH) , yeast 
arg.inafle (yeast ARC] , rat, avgiiiaaa (r<it ARfi) and human 
& arginaae (hutntiTi ARO) . 

Figvre « shows o Jioctthern of Hcol di^*?rta of 

g'enoiaj.c nuA firom fiv«i prrt^utnptive raubantis (lanes 1—5} and 
fram wiW— Cypc S. cl^^/ulioQrua ( lAno (5^. Panel A ; 
meittbrsiPeB pL-obed with cld-3peci.tic prolan, Pawe.1 B s 
LU meiBlcxonea prubed v;lt:h ter-epecsifiu jt*rctoe. 

Pi<yurc 9 shoxra reatri-cfci<jfi cn7'y(ue laaps of fij^ 
KL avuliaftruE DHA inserts in cosmlda* A. R^strlc-ttoii 
«n7.y«ia map of co^mld K(SL^ - B. C-autxal roatriction 
ancytne »ap of <!oaT«td KaL2. C, Roatriction jjiaJ^ Of 
13 cofimids K&L2 and KSLS Indlc^^Lnq loc&tion ot pcbG gerii» in 
relation to prJLa. T+ie 2.0 Itb Mcol CYa^men-t 

oncowpos&ing the cla ^ene u«©<^ In generating kid&trod 
ddlftClonQ for aequencing. Abbreviationas fiainHX; 
B,aalllf ^tfiS^Vt K. Kpn If N, HtfQl; S,fiflJ.T; and Sjn,6giat,I, - 
20 Pigur* 10 showa the doOiuoed awino acid s^tqpience 

(Sequence ID tro.;3> or onn of Pioiirc 2, 

Fl9ure ia showa ths d^duoe^ atnlno acia ^fcfquenca 
(Sequence XO rto, !4) of 0RF2 of figure 2. 

figure IZ Shows the deduced amino- acid sequence 
2? (HJSequenca ID »o.:3} ot 0«K3 of Figurd 2, 

Figure 13 above the deduced amino acid saqa«nc« 
(Sequence lO Mo. :6) uf 0RF4 ot Figure 2. 

f'i9ucB 1'4 bilious the deduced t^inino ftcid aequanco 
(8«qu«nce id Ko. :7j of ORP5- of Figure 3. 
30 Figure 15 show^f the deduced anino acid sequence 

(Sequence ID HO,; 8) of (}Af6 of Figuro 2* 

Figure :i5 ahovs the deduced awino acid lisoqiten^e 
(Sequence ID Ko.:9) of ORKV of Figure 2, 

Figure 17 ahows th« doduced amino acid sequence 
35 (Boquencs ID Wo. :10) of ORf'U of Figure 2, 

Pigurt) 19 sUova the deduced Amino acid aequenco 
(aequ^nco in no. ! ii> of ORF9 or Figure 2. ' 
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rt,<9rnrft 19 ahowa the aeduc€»d aftlno acjd sequence 
(Sequence iD Ho.: 12} of ORFiO Of Ficfura J* 



DotAlleti (lL"^arir>tion nf the Invention 
5 pro^uct-ion of penicillin an<J ccphamycin Antibiotics 

in Si_slav.y4-igarJl£ rst-Art-': vtth the convetsflon Qt*. lyaina 
to Or-flmJno^d.iplo acid (>I-:nilcl«i-i ft I,./ (1969), J. 
Bacterid., v. 171, pp, 293-302; (3991), J. Dacteriol. , 
V. 173, pp. 9as-98n) . «-JMnlnoadipit; ftold th&n condenses 

iO with oyf.tfti.ne and valine to givo {-(Ti-'«-a(»inoDdiE>yl)-L* 
cyateinyl-D -valine (ACVJ by the action of Aainoarflpyl- 
cysteinyl-valino synthetase (ACVS) . ACV 33 converted lOY 
isop«niQi.l 1 In u Bynthne© <IP}?J=:> to iaopenicillici and, 
through n serisfr of rcftfttlons, to de&cic;«toxycophalosporin 

15 C and uHilmatfiiy to cephamycin C tJeiJi*cn et al., (ISS'V), 
AppL. lUcro>i).oL. liiotechnol, , v, 20, pp JiiS-lSO) , 

The ACVa of g. olaV ullffc^YU s has been puctflcd 
partially uli^irACtorizcd by three aepeirtito groupa, and 
estimates of Its tnolecular veli^ht; vary tram 3 SO, 000 to 

20 500,500 Da (Jensen et (I^SU) J. BactorJol,, V. 173^ 

pp. 7269-7271? SobxjfecKe et al,, (1992); Bur* J, Biochom., 
V, 205^ pp- 6a7~6^A; ahang and D<suiain,r (i«90}, Biotocti 
Lett., v« la, pp. 6^9-S54). During thair purification^ 
Jsnaao et al. obeorved r. J2,oOQ Da protein wiiloh co- 

23 putifi4d with ACV? rSeapite proCddurdS wKicti ahould remove 
sntall molecular t/eight conipon&nta. It hns noU' bftcn found 
that this pt&tein Is wat related to ACVS but rather to 
Cl<iyaleit)il.O add biosynthesis;. It has been designated 
CIA. 

30 In accot dance with one enibodiment Of the invention, 

the prGS&nC inventors havs idantif i€icl, cloned and 
S«qUftnced tha gene (cla) encoding tins protain. 

In accordanuo W.lth a farther ajnbodlWLcnt of the 
invention, tUa Inventors h«VO cloned and soqucncod a 13 

>5 kb stretch of genomic QUA from S. q ^avuna<!rua which 

includes tho oia. gene. Within thia 15 kb sequancs, th« 
invant<?C3 hav« identified an 11,6 kb PWA fragment Which, 
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v/hon lntradLLC«»it ijit<;> t)i^ r!on-<.^tftVuifln«,tf^ pvoduc^v 9^ 
llvtdanf; Afj ao^JcrLttcd In tOxatuplo 4, enabled that specie:* 
to p-roduca clavulanio acid. U'hie Indicate)^ ttmZ tn« 
kb frngtaent containsi all Uho tyomittc lurormatton repguircd 
s for olavulanAtft product. i on. 

As vill be unfleretaod by thoss ssKilloa in the art^ 
ttiia identification of tha ONA sequence encodinq tha 
ftnaymcif requitod rot ciavalanalift synthesiB will perjnit 
qrenetic manipulationB to mo-dify or ontianco tfi4VHi«nftte 

10 production. For example, <jiXhVUlAnato production by 

ol Avaliaftyufl \wy modif ied by Introduction ol cittr* 
copiaa of th» gene ox' genrj:^ Ifor Cfttc Itmiting^ enzymes or 
by altoraClon or tl\c requl.atory componeota controlling 
enpreBeion the genes tar the clavulAnftUc p*tlway« 

13 KaterolagouB ocgjni&ms 'S^liicH do not norjn^lly 

plroduc© ciavvKinatc may als:o fc>o enabled to proditca 
clftvulajxate by introduction, for example, of tllo 11,6 kb 
Df/A ssqu&ncs of the invention by tecixniquea \7UlcK are 
w^kl Knovn in tJie art, a^: ^ae^plified herain by tho 

20 production ot S. Mvldana a trains capable of Clavulanate 
eviatheais* Such heteroloyouH production or ciavulanic 
acid provides zt ma&na of producing clavul/inic acid fr&e 
cr ochfrr oontam-lJiating cXavams which are x^roUaciod by fij. 
c X a vu 1 icf eru a > 

25 Suitable va.ctors and ho«;t? vIlX be known to tho^« 

sHilLed in tlie art; suitable vectors include pIJ702« • 
pJOSfil^ and pIJ{i22 and suitable nost» include s. 
lividans . fi,_feary.UlZWSr S- crlflgofTil vtia . 3. antlbiotl gUfl 

and g, ItpmanlU 

^0 Additionally, the DMA sBqu&noo^ ot the Invention 

enable tlio production ot one or more of the etitya&if of 
the clavulanate pathway by exprossion of th« relevant 
gene or ganes in a Jiotoroioqous expreeaion Bystem. 

The DWA s<iqiie.n«aa coding for one or rtor*» of the 

3 5 pathway enzyzjea may be introduced into suitable vectors 
and hosts by conventional tccbniq[\ies knovn to those 
akilLed in the art. Su I tables vftctore include pUCli&/L19 
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and pE?r-n aivl sultabLs tiosts inclwda mary oxgani^ms^ 
incluaing B...^oU strains siicta aa and iiL2l[DE7) * 

An Ol.iqoixuc Leo tide prob« tMiSiCCI on tha K-torTBtnal 
amino acid ssq^u«nt;v» oC Cl.Jv vas constructed as shown in 
6 Figure 1 and was usfta to iBolate ttio ^ftre coding fox Wie 
profcoin from e. clavuliaa rus. as der.cribed in Bxftinple 1. 

The qerts. vaa found tO bc located in ttlC £j. 
nlavuliqer Ma chromosoiats about 5.7 KD dowatrenm of pcbC, 

tha gono which encodea isoponicillin H aynthfii^. The ,*. • _ ^ 

10 genH containe a 933 top opRii raading frame ((>RF) * encortin^ 
a pro t din of moincnilar weight 13f3*;«, The decluced Awino 

acid se(|UCnoe '-/as conpnred to dat^hAse eequ9nt:<»S ftp4 - . 

£ftiDwe<l greatsBt Biinilariuy to ftnxymea associated with •.. -'• * 

argliiine inGtaboLisfA, pcthbly agnatind^ urcobydrolaae cin^ 
lit arginasoiS, \i 
When an intamAi Cragment of ttxs Cia gene waa 

label lad and ws&d to probe re:»trlr'tlQn endanuoloa^e 

digasts of genomic DHA from a variety of Oth^r 

RtreptowvceB a/»d related speciea, ovld«n(ie of hoi»olO<(OUj5 

sequences wasf &een only in other clavulanic acid of 

clavaa nctaboUte producers, incLudinq gtre pfcoa^gCfig 

jmY.Qn;iLneftfl£e . Stce^ oKyccr. lipTnaoii (7) and fitreptowtv caa 

flnbibiQt.icu 5. Ho ccoas reactivity vac seen to the ^- 

l&ctaia producing apeciea Njififljalia lactamdu m/is . 
35 Streptomvcea qris eus or Streprtoinvcss cattlGva. nor to dny 

Ot a variety &f other fitreptowvcBa tfp«cic3 T*hlch do dot 

tilroduco ^-lactam compounds , including s. fra dlaa ATCC 

1&60?, 5. venaeu elim 135 and 3, arifla ogulvii*; NBRl/ 6-5429. 
DisruptLdn oC the da . gene, ae described in EDfaeaple 
50 led to Lnea of the ability to ayntheaisa clavailanlc 

aold, 

A 15 hb DKA &equei)ca extending dovnatream from jgt^SQ 
waa Cloned (vnd eeguenoeU deacribed in BxampLo 5. The 
mioleotidB sequence Is ahovn in Figure 2, Khan this 
3-5 sequence Intormation w»b analysed for percent G 4- c aa a 
£\mfitlon of codon pOf:lr.ion (Bibb at al, , , Geno, v, 

30, pp. 157-150), tem complet& ORfS vere evidsntj as 




^^•y \": " ..." V 
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lUQUrtL* 

ai&l CACGCCQTCG CCCQ&QCCCT 

CTCCACCQTifi CTGATUCGCL UtiCAGTACCC 
ttCTCUGTf^e tCCCCCDC{>Q AOCCC0GC7T 
0341 tGACACCCCC CCCAAfiCAGQ TCCCGBTLAG 
8401 CAAGGGCBTC ttftATCCTCG /iCCCCCACAT 
B^fl> CGCCCEGCTG GACCCCiGCCG AQCAGGACCG 
852 ^ CAACGCCG7C ACCAKnArA CCfiACACCfC 
6^61 CCGCCraCCG GCCGAt-GTCC AC^CCGfiGGA 
d^^l GeCUTtiCrC AA^GACATCG CCAt^CCACGG 
6701 &GTCACCGGC CCCCGi:aACQ ACGCCCACGC 
B761 CCC(^rtG&re AAC:ACCfiCCG ICCf^CCCCTC 
B62I CArCQGCfcAQ rWTCGGAM ADACGGAtAT 
868 \ L^AGGTCGAG GTCfATCCQC CQAAOGCCCIi 
8941 CQCCarCGCO QAQCATCfGC (CSGttiACGA 

9001 QQACGQG6CC TTCACCGtCT ACS6CTCCGA 

EM of aJtF 6--^ 
9061 &QA6TACACC ACClTfiAtccc c^gaea^gQa 

d121 cfio^^cc9t9 tg^ttalQcc gaccgttccc 

9181 acQ^^^cCiCQ gc^cqgce^c- ercgal^fag^ 

§»al.nnlng of DBF 7'-» 
0241 ArcGAGACCA CTCCC7CGAC GACC&COQAC 

9301 qrCGPaaCA CCGAtCCCCC CftGCGfit^ACG 

9361 TCCCTCQACC CCCGCAACAC GfACTACGCC 

CGSACGCrCB' TCACCTTCCA CACC&CACCG 

W\ CTCGCCGAGT CCCFGCCCQA GICCTCCaACl 

9£<f( GASGGCCrGC GCTACGiK^GA CaCCACSCCG 

9eOt QCCCGCA&CA ACTACGGCAC CCATGTCCCt^ 

d^l CTC6GCACCG AGTACCGlCGA CCCCrGGCCfi 

9721 WBACCCCffC AceiGCGftAC GCTCCTCTTC 

&7ai CTGCTGGCCA CCATOCC&TtT CACCACCCCt 



^ - ti 

C:(IGG1;TGCCG CAGfiGCOAQA TGC1GATCCC 8220 
GATGCACACC ATCCGGCAtlC AlltKAAGAC 4280 
CaACCCtGCG GCCCCCGCCA TCATWCGAC 83*0 
CaTtBGtGfifr OCCACCncc fGUCtATCCit 8^(00 
qeCGACIiCTG CTBAtCnCT TCaCCACBGA 8460 
CCTCnrCGC CGBarcATGC ACCCCMCTT 8620 
cAtCAGCfrAC WCSC&GTGC IGfTCGCCAA &6e<l 
CTrCCA(;CAG GCGCTSCACA CGCCCCCGCT afi4[> 
CaASQGCGCG UtCAAGCTCA TCRAGGtCCA 8700 
CAAGCGCGrC CGCAAGACW iCGfCAJiDTC 8780 
CCACCCrAAC TGeGGCC£G6 TC^CCATCaC 8^20 
CGACCAGOAG CGQGT«AC-GA TCCGCTTCCq Sdd^. 
C^qiGCqLACCAG GtCftACCACG CGC7CCSGGC d&40 
GCT^CrCATC G6GATCCACC ICGCCATCQC fl030 
CCTCACCGAff G3C7A7SrCC DGCTGAAtTC ^080 
accfggoc9cc pccccgtLte etQlcc»c-tc 9120 
cggc-talQC^ coc^ggocfg ngc/g^ccccc 9160 
9C9at9cci09 ^^qacgos^Q 009309990^ 9240 
CACGGCTTCG ACGCL-ftCGCT ACCGGCAGrG 9309 

C7liCC«CrGG 7CCtCAeG6A CGACTICGAC 9380 
TACACCTCGA AC77CCTCCG GCTCATWQC 9<!20 
GCCAAGGCCG GCCAGCCGCT CGTGCCCCIAC 9430 
GACCCCCGM TCTCffAiCrA CCGGCTGCGC $5*10 
GTCG7CTCfiG CCGACATCAA GCACGCCATC S803 

G<icc<:ccGrc cgacctaCTT ccccCACCfC 9660 

GAGCCGCACC CCCACaGACC G4TCACGC7G 9720 
CBGCTGCGGC AGCCGT7Cf>C CCGBArGGAT 9750 
GTfiCCfrCttCC ACCCGGACAC CGCCfiCCGAC 9840 
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ViGiim 2 - V 

TACCGBCrCC GACCSCCCCB fACCCGATC^^ TCTCBfACAC CCGCCGCaAH MCO 

9901 CTGCCCGTCC fCGACCCCAA TCCGCAC7CC CACCCCCAGA DCGACCCQaT GCGCOTCCAG 9960 

9«ei CGCBCCICCC CQATCGAG^T HCACnCCGC AAGSALXClX ACGAGGTGGA CCNTATQCTO 1C020 

10021 CTCGCCMCO ARGCCCATBT GWCCTCfiW G4CnC«T5 TBCASCCCGi CGCCCAGGAB lOOW 

lOOai CGUrCCTCG CCGAGCCGGA GCTGCCCCCB CitCGCGGACA ACCCGC7CAC CCCCnCACL 10140 

lOMt TCCATCTIXCT GCCTGTCCAft CCMArCGCC CCCTTCGACA A767WACTG CCGGCG«GCC 10200 

10201 nrGCAGTTCG CCACCt^ACAA fcGCCCCCATG CAtsGACGCGT ACGCCGGCCC CGTC^iGCCCC )02SO 

10261 6ACA7CBCCA CCACCCTGCf GCCCCCBACC CTCOACCCCr ACAACCACH CrxACCeCFAC 10320 

10321 CCflQtCQGCC CCfiACGfiCAC CCGCCACCW GWiGCC^CiX GCCCCCACCt GAAGCrGGCC 10380 

fOaei fiGGArCCCCC ACGGCTTCCG CACCABGATC CCCGCCCCCA ACBAtCGCCT CAAnSABTaC )o*\fio 

}044t CdG^CCGCCG A^GCGCKQC CGCCCGGCTC GCD:G«GTCG GCATCGAGBC ^Gi^GGTCCTti 10500 

10E01 GACnCCCGT CGGGCBACTiV CTTCGACCQC TACftGHG&Cf GCCCIlGACTh tcrGllfeCGAG 10630 

CACeWATCG frGATCATCAT G7tCGGCTG4 GftCGCCBAtT TCCCCftAtGC AFACGGCTTC tOaSO 

106;21 CTtCAftCAQA TChCCCACGG GCGCGDGATD AAGGAGCU4 ^CAACCAGAA CATGftGCGAQ t(KBO 

IQ&8) CTQOACGACC C6GACATCAA CQCGCrGCTG CACGAGGGGG CCCAGTGCGC CGACCC6QCB 107^0 

10741 CfiCCCCQCOQ AGATCTGCCA CCGCATCGAC CAQCTCAGGA tGdACCACGC GSrCAfCGTI tOa01> 

loaol ctauTcr&t acccgccgtc cctcctctac kgcacccgg acacccgcaa ccccTrccrc loseo 

end of dRP 7"* 

10S61 ACCG^CrCCT ICfiGffAKHA C&ACTACGTS GCGCTCCGCG CGAAGtGA^c oc999atcC9 lOdSO 

10921 Qccccgggac cgUtQleec q^ccq^Qc cccgcccgU ccccgcccQy tcc^^tecgg 10BS0 

109BI occCQgte^o s9COcgcTCA CCCGGACATC CGGGCCCCGG CCGCQACCCC GCftCC^OATC 11040 

llOdr GGCCAQTGGC CCrCCGCCA(» «GGCCintCC ACGCTGCCCC A^CUGAGC GCCC70GC6G 11 100 

(1(01 AACTCCOCCr CCTACAGCGt GAfiCTGCCftC MGAACTGCC 66(STCC6GCC GCrUG^CTG 1 11€0 

11191 GTCCCCCGCG GGCTGCQCAB DAGaCCCBG G&GCC&AnQS ACfGCrCCAG CaSBtGAArC M2S0 

11221 CGQCGGQTGA CCGCCGACTQ BCTGATtGAC /^iSCACCGCCC CGBCCCGGTT GATGCTGCCG IISBO 

UZai TGCCfiQttCCA CGSCCTfiGAG CAGATCCA6A TCftTCCACAT CCAGTrfGCG fiCCCfCGSCC 113^0 

TGQCCCGGCA CGCAQCCCTG CTCCGGTCCC GCCCCQAAGC GGCGGGCCTC CQCGCCGQTG 1M00 

IHQt tGCTCCCCGT ACCAC7GCCC CCACCAQCGC TCGICCACCA GG7CGCG£TB GTC77rGGCG IllW 

1148) AAGCGCCG6A GCTCGACCTC GGCGAtCABC GCGGCCAGCC QTCCCCCCAG CQrCCGCG«C I 1&20 
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USai CCCACCAGCG 

\W\ CC&tGGCGCJt 

11701 tiTGA(^CCCC:fi 

11791 aACACCCA&<l 

11821 T6Cr£CCl3G« 

1(881 CQCTCCWGC 

MSfltf CCCGCCJkCTt 

12031 AGC&AQ(iCCG 

I20a\ (idcACCTCCG 

12)81 AGCCSGftCCG 

122^1 TCGAGrCTQC 

12801 tc-teeqcoq-t 

12381 9tc99ttct9 

12^21 attgtcc9aa 

12541 -tcce^QQaftc 

1260 1 gacetg^te^ 

isd8) DQrcQccne 

12721 UGCTTGCGG 

12781 BtCGCCCCCC 

)^9<I1 CTCQQTtaACC 

129D1 GTA^ACGGCC 
GCCCTrGCTC 
ATT^SrCTCG 

130ai CAT«ArCCCfl 

13 till GACGCC^ICCG 



GG7CGACCA^ 
AGCGGTCC7* 
GGAKCCCCIT 
CCAGCrCCAA 
TrTcsrcccG 
frCAG&ATCAC 
rGTGCCGGGG 
GGGCGGCGAC 
CCGCtTCLTC 
GCGCfiTCCftC 
GKTGGA&GAA 

TSTMCGCCT 
G-CTCCACWr 
gteecaccgc 

m«(C£iCGG 
A7CT0OCTQA 
ftftCTCCGTQT 
TCCTGGCGCA 

GcorrGCGW 
cgcagca«ai 
ArcArccacB 

GCCrTCTTGA 

iccAccctcr 



210S113 

CA<}ACrGG7G GrfrCfiGCCCt CGCGLTCCGC CAG6GAGCGG M5$0 

CACCCCCGfiG 7GCG7aG<LC:c AGCCGAGACC 7A7C.GCGyCC t i3qo 

IfGUACDGAT fiCCCCCGT&A r(^7GCACCCG GG7GGGDGDG M70O 

GACACCrCrC ACCAQ^AICT CtCAGCttCC 7CCCC7C7C(* \17M 

ZAUATCdQCQ AGCGA&ACC7 CCCCCCCl^GC ffGCCAGCGGA 11 d2l> 

CCACAf^CCCd rCGTCtAGCA CCrCACAGG7 CC GC AC G«AC 11860 

GGACrCGAGtf CTCCAGC7C7 AfiCCCGCGTC CACClirGtAG n9>*0 

CrCfiTCC^G^ GCCrCGfOCC QriACr.GACAfi CAGC/tGG7CC 12000 

ChCCACCrCG TLGA^CAGGd G7rCGC7r»GA GACCAGCGAC ISQ«0 

GGCnrGGAG CCATUGCtGA AGjUATSWT CCPC^JCEGCC )&120 

CCCCCGCCtG CCCACCAGCA TOCGGGAGCe CGCCG7GG'rC 121^0 

CCUCAGGGTC AGCGGBAGfcC CgAUATCCC TJCCAOCCGG 12240 

*- -beginning ORF d 

CCCGrCGCGG ACACCCGTCC CCCGGGCCAC TTCCATs^gq 12300 

gteeo^taoA ^^caqatcqc atcggctgoc accq9cagce 123Q0 

caatgtcggt tcccttttcc gtcociggacl. ' gtaccQctija 12420 

oq^igcttcg gdat^^atcc toggCQgcgc cgcte^Lcgg 12460 

gaacgcgccc ggccggotgg cgggcgcg-ct cc^ggegccg 12540 

gcffcgfjcocg gccggccocc cggiccgggc gcgcggcg'tg 12600 

*"»nd or CRF 9 

arCAGACCFG «7CQGTGGGG CCTATGMGA TC7CGrcCAC 12860 

7CACGGCGTA GCCeACCQCC 7CCaCGA7GT CtTCGfiCC7n 12720 

7CCG€TCCrC CTACATCTCC TrtGTGGCGC TB7MGTGAT 12790 

CG4TGGrCCC CGCCTCCATG ACCACCACCC OCACCPCGCG 126q0 

GC6TC7CCC7 GAAtOC07rC ACACC«AACr TCGTGQCC7C 129Q0 

CG17CACCCG GCCCGLGATC GAGGA^ATCT GDACCACGfi7 12990 

CGGGMCCGC CI^CCCQRaTC ATSTACAfCA GGCCCA-QGAG 13020 

TCtAGTCQGr GGTGTCGGCG rCC7CCACCG GGCCCACCAC 130B0 

CGAGGA7C7C '^AOGCCSCCC AGCffCcrc^A COGTGGAGGC m40 

GCCCCrCGGC (JACCrCGACr TCGAGGACAf WACC7rCGC 13200 




.■■y■y;^:i^^i^.v.•,: 



• ' ■ • * . • •* • • 
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fasQi ccranciiGCG crcAiscTCGi cacccaq^c gcuuqcttc TCCACcreciC ccccggcgat 13Z9Q 

)92Bt COCCAC&CCG r;rGCCC fCGf CG«CCiX&C;/iC: GCGCGCCGTG GCCTCGCCCA TCCCC^AGCr 13320 
1:^21 caCGCCCGTG ATGiLGCGCGA CTTTCCtCTG GACTQCGGAT QQ^ArcaUl ccUxacatg ISOBO 
gtgctg^got egi99t9a9c qlatgoogoa gQg9t909QC clytrc^tQcc ^g^QCQggt^ I94'<0 

13441 ccgtQcqcco gaccgtt^cQ ^tg^gcoc^; rcgucc^g^t ccqgotqcjcc ccqgtUccc )3iO0 

gqsgogltcc cggQgao^gg tt^aolaccgt: g^CQctctcc gaCggtcttc ^goggococc I^G&O 

136d1 cg^ggottca cc 99904^^4 gcg^cc^ga^ tictccccgi ccocggcoga cgctalcagc 1^820 ^ 

13821 9^C9Caltcc c«Qi}^9aQ*>i ectUcq^t? gosc^gg* Id (QQC^i-t^c cqccgggtto 13SS0 

13681 tgcfjcgcrgc ece-99Pg9ac cgcjccocccg cccQQggocI gcggcociatf y^^c^ccacg ,*,". , 

&09mning dT 6RF 10--- > 
13741 Dcot^dcgcg OQcag&Qa^c 99C99t99Ar C;iTGAACG;iCl fiC&GC^ClC AGTCCCACCA laSCO !: 

• * - , 

13801 GSTGdCACtG GCCrArCCdA rciACC^G^T CTGCCCGCTC BArCCGCCGC tCtAACTCCi UBGO v ' ' 

t3B6t CGGGCTGCC9 TCCCAGAACG CCGCGAGCCG QGrCACGCTG TCfiGACQCCA QCCAG^tGffi 13»20 



1392 r GCr6QTIiA[:c rCGCACGCCG aCGCCCGfiCC CftTCCKGCC CACCGCCGC^ fCACCCCQCT TSidBO 

\3&ai tACfi-AftCfiCC KCfiacrtCC CGATCCTCAC CCGCACCTCC CAAC7fiGr6C CCCCCAACCC iWO 

HOCIi BGAQTCGGCC TCCnCATCC GCATfiCACfiA CCC&CAflCAC TCCCGGCTGC 6CrCCAT4Cr 14100 

14101 CACKCSGAC TfCCTtWlCCC QCCGCGCCBA DGCfiCTGCGC CCCSCGGTRt GCGAGCTCCr 14K0 

14161 CGAC«ACArC CTaSttCftCGC T^GT&AAGG& CaAdCGOCC^ QfCGACCrCG TCCCCGOACf 142Z0 

1tf221 aACdATCCCS 67GCCCTC8C frCGTCAKAC CtT^TCTFC CtacaCCCGrG AC&ACC«CCQ (4280 

I42ai QGACnCATC QAQUCCQCA GCGCGCTCCT CATCGiACCGC GGCTACACCC CBOAGCAGCr H340 

1^341 CGCCAAQQCC CGGGACGAAC TCOACQGCTA TCTGCGGCU CTQQTCGAGG AGCGGArCQA 14<I0Q 

iq^Ot UACCCGCCC ACCGACCTGA TCACCCOGCT CGrCATCCAC CAGGTGCCQC C9GGCCATCr 14^160 

1/^491 GCGCGrCGAG &AGtATGGrCC CGAT6T6CCG OCrCCrQCTC GrCGtCGftTC ACGGCACCAC I4&20 

14&21 CACCAQCCAG GCGlCCCr&A CCCTGCTCAG CClBCrCACC GACCCGGAGC TCGCCCCQC& I^SBO 

H5«l CCTCACCGAG CACCCGGCCC TtCTCCCCAA OCCGaTCCAC GACCrCCF^C OHrcCACTC 149*10 

14641 CATCflTCCAQ AACGtGCT«G CCCIiTCCCGC GfiTGCAGGAC 6TCCAGCTCG ALMTGTOCT HTO^l 

14701 CATCC&ecCfi GGCOAGSaCG iCGrfiCTftTt OCTGTCG<ICG Bt^CAACCGGG ACGA&ADGGT 147aC 

H701 CtrCCCCGfit tCCGACCGGG TBGACBieGA CCCCCAWCC CCCMCMTC TCGCClfCfifi 14B2D 

11821 CCACGdCATG CACCAUrffCC TCGqcCAGfC CCrCGCCCGG CTGCAGCTGC Al^CAGATCCt 14880 
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|11 / 28 jg 



1t3SI CGCCUCiSffrS CTSCQCfEiiJA TwCCCuGTCb 
1 1*91 1 CTTCCCrCAT GJkGGTGTCCA UrTACGGCCT 
\60Q\ glggagcgqc tgaccgtcg't cct.cgcicgcg 
\50ai gccaccigccc ccciq^qUI. 

I to I 20 I SO 



^ - 20 

CCQ&CTCwCu QTGCCCTtCS ASGAtoCr&dA ^ii9<iO 

erxS of ORF t(?--> 

CIJGCGCCCTC CCaCTCALCT fiGTGAQcgQC \50Za 

^cggccVqcl yi:ScgotQg9 qcgctgcqcg 1$060 

l£f079 

I I 50 I 60 
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13 

program aot-vrLt^td above. llie AUH tfoqvi«rice iia<i provlouely 
bOCt) ftLlcfJiftd wttli the three ARC e«sqtjences {Szumanskl £t 
HQyL« (If^OJ, J. Bjct^rioL., V* 172 > pp. 53€-547)* 
Idon-tical matCJicn in tvo or mora soquencea are indicated 
S with upper caB* aetteus. 



BxamplQ 1 

Genooiic DNA preparations frofti Vftrto^a Strep^t/att^SJfiS 
10 sp^cios wer« Isolate a as ci«Si;rib«fl by aopwooa ot al. 

(JL585} . *'or intersijccicr. nwA hybr idieeition analysis, 2.0 
;!ig amounttt oi gonOTliJc Pf^A prepttrationP vera dit|«i£t:&d with 
Weal for lOh, aK(i electroptiorcsctS in i.Olt agaroso qrale. 
The scpara.te«X uKA fratfiBontd Vfo-re. then transiarrett onto 
15 nylon merohranss (Hybond-n, Amsreham) an<X hybridized witti 
a cXa fipeoifia prob« prepored by labeling an internal 43» 
lap sail Craqrnient CPig. D vlth («-*'ioaArp by nick 
translation' KybridiisfiLtion was dona ds d'Oficrlbed by 
earftbrook st al., . Hybrid! Station TSieiinbran&B varo 

20 washad twioft t.av 3 0 min in 22E 0.2% £jDS tind once for 

30 »in in O.LX 33Cf 0.t% SD5 at 6S*C. 

^^W ^V j Ct^s* h oao a oaou e . t o _ Cila In other St i^^pi^emyeGtea 
Itirea at eix producera /9-lactaw aijtiloioticB,. 
25 clavtttiqeru g. S- Llpmftnii and fl^ j^monj 1 nensia allowed 

positive hybridisation signals vnnrceta s. catt laga^ 

grlgat>« - and K. lacta m duranji dl*l not (data not shown) - 

Kors of the nonprocJxxciuq etraine s«&Biinear 5> veneKjualAa. 

S. jl^^ ytdang . S. fradiae . S,, a ntlbtnticua and g*. 
3 0 ^rlaaofuBcua gava ohy s^i^nal. All of tlio sAtraptomycatse 

that 9aV4 poftitlvo signals ivorc» prodweers of clavam-type 

ttiatabOlltC!?* (KLson et al., 1987) 



i;:itaniala H 

33 Dl ffp.iptl<iin of the aa nom ic cla gene 

A 2.Q Kb Mcol Crac/meut that contftjred tha entix-« pjfl 
9ena wks digt^St-ftfi at Its unique Kon l site and tho «n<ta 
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tiiade blunt by tTvatmei^t with tha Klanow fragiaant OZ Ej. 
TOli DMA pcilymcrase .C. A tJiiostro-pton resistance ^ene 
( tor ? , ifjolatecl aa a 1085 bp E.qll frogmant from prO'702 
ana cToned into the BwdHT si.te &f ^lUCllS was ejccisod aa a 
5 -Sma i /Xbal ireitjmonC fl.ucl the ends UlAae t>lunt. as ttJJCve and 
ligateSttJ into th« LCpnl sit© of da . Th.© licrrttion mixture 
was Introduced into R. col. 3 MV1193 and t.l>e tr anef orjtittPta 
screened for ttw pr^-Sftjice of ttic Hjail qfeue by COKOAY 
hytotidtsattcn (Sambrooh. ot al.*, 1989). 

10 Peplacsntent f^t the chrcmoBcundi ol.a gena by a copy 

diarupted by thC inaesrtion of tar , at an internal KeqIj 
site, vas achieved by double recoJ^binat^ on* SucuaSfirul 
gene rep lavement was apparent When the 2.0 Kb HGaT 
fragjD^^nt whlci\ carui-giEi t^ia In the wild ty^e ^r-ganism v/n« 

15 rcp?.ac4d by a 3.0 Hij Wco l fragment duft to th© insertion 
tiie 1.0 kb tfir qene in tft^ jnutanta. Four Of the five 
nut&nts ter.ted eho-rfed the expected increase lu the si^te 
of the Hco J fragments, and the lauqcu' flCfil fragments ftl«o 
hybridized with A t&r specific pro-bc, T'he fifth mutant 

20 vas appar^ritlY a spontiinQooi; tlieoatrepton reaJetant 
mutant . 



AotibJ 

The agar diffusion, assay vae usad for determining 
25 bptti penicillin^ ccj»hauiycin and clavnlanic aoid 

praduution. s. clavuliqa rag strains to ba as^y«d vere 
grcwn 10 ml. amounts of Tirypticase Soy TSroth (T3Bf 
Baltimore BiolO^icai Laboratories) medium -with 1,0% 
atdrcn for 4flH. ITie cultures Tfei:s waah?^ tvica vlth 
30 10.3% sucroae and Oftco v/lth KM (Jensen ct al . (1962) , J. 
Antibiot., V. pp- 4Q3-/I90) and the laycBlium 

reauspetldftd in lO.tl mL of KN. Tvo rolllilitrcs- of vcaBhad 
cell suspenaion tratf Inoculatsd Into l<iO i«r.. of mi and 
lacuhated at 28*C f^or dHh. TJie cultures were harvested 
3S by centrifugAtioH; and the supernoteintif W«P.re aeaayod for 
both penicillin/COipUawvcin and c;l«vuiant.c acid 0«in? 
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19 

blDaasa/ procedures desuril^eU previously (Jensen ct «J • 
<i&8 2), suiH-riJ . 

Aii or the rosultincj colonies «itti alarupted cla 
9«ne& grew equally v?ell on roinimai mfrdiuw and coRplOK 
!5 media and produced a'J nuch pcnlciLlin C^pH&lliyclTi aa 
did tUo wtlcl-typ/^, but produced no ciavulanic acid (datd 
not sliovn) . HPt.c annlysis tjf celX aupernatants (Jonrlrmed 
the inability of Uic iHcruptei fiia mutants to synthfiaSa& 
txny clavuian.lc arid (data not Bhown^ , 

10 

example 4 

£rg>tQRlagt formation and ti-j nsCgnMSOfia 

ft. eoli competent (^repArAtion and 

tranaforsiatlon were as deacrlbad by S&mbrooK et a1.*j 

15 <i&B9). ProtopIgBtB of s. clavui lqarug; vera, prepared, 
tratuifortiied and regenecAt^d as dciscribed by £lailoy ot al. 

Hio/TttOlincilogtV/ 2, pp. 808-811, wJ.th tba 
following nodlCicationa. Dextrin and arqinine in tftd 
regeneratLon medium wt^to tcsplAced by etarcli nJld SOdtUW 

ao qlutamate respect Ivoly, virotoplaeta t/erc heat £|)oc)eed at 
43 "C Tor 5 min |*irlor to the addition o^ P«A. Standard 
procedures Vfars used £or prcitopl-^ating and trans £om»t ion 
of a. liv idaJta (Hopi/OOd et . <LM5)J. 

TUe 11.$ kb.J2SS9Ra tr^gmant Crota K6L2 <rjLq. 9) wad 

25 Clonod into the KcoRl site Ct£ pCAT-119» pCAT-llS is 

derivative of pUCllR vlxich vaa prepared by insertlonally . 
inactivating the ainpici llin reifi^anco gene of. pCCilS by 
tho insertion ot a chlorftmphanionL acetyltronsferasc gene 
{jTcTiaan et al. (1989), Genctiea & Kolec. Bioi. Ot Cnd. 

30 Hicroorg. , pp, 2J9-?4i; Kd. t3erchb(ir9«r , ^tner. floe, 

WiccQbiol), The PCAT-H9 plaaraid carrying th« 11* G lib 
fragment wa« th«n digeeted with PistZ ar4 liqated bo th« 
atreptOTBVces plasmid pIJ7Ci2 r Vbi-cb had alao be«n diefested 
vith Pafcl. Ttiv rcsuXtlnq bifunctionol plapffltd carrying 

33 tbe ii.SKb insert vae capable of repllcatli>q in either H*. 
gffii (with R&Xection fotr cbiorniaphenicol reaietanoc) or 
In K. lividana (wltb selection "for thio»txepton 
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cot i ■ pjAftinid 0?IA wae isolated 2X^f«i aeveral o/ tiOQ 
chlaxamphgnlcol tuc:|.&t.Anc tLranstorEnsLnt^ And. analysed JSy 
a«fAro$0 el ftcti^ophoreeis to on^mre that the plC&p«r 

S pl-a»Tji3rf conBtvuct was o3t>t«in<if3 . This iaolettod p\asmxd 
material from B. <^ol1 - was then trantsf oraftd J.nta S*_ 
pit vldans a? <icr.crT.bedl lay Hc^pvooct flin<S tTansf t>jft<ints wera 
selected by pLating onto R2YU roedium containincf 
thioBtxepton at « concentration of 50 w<y/ml. 

LO Ttiiofit-T^eptou reeiatant 6 . livid ana trans formanta 

carrying the bifunctLunai plaawLd -with Uio 11-5 kb inaort; ^ • 

vere patched onto MYM agar* platee and Ailovred to incubal:& 
for 43h at 29 "C b^rore they varo ovorlfKy&-r&& wi*hmolter» 
So£t nwtrl-ent aqror contaLnin^ penicilLin G at A 

15 conceiitra-Lion of 1 ^9/^'^- a»d inoculatod wit^U 

^ojf'h yloaocaiiB. aureua u-2 «s Indicator orgnnisjn tJe.n»«n, 
1902) . f S* aureus H-2 Wft!> obtained from tnc Oft.pAr±,7nsiit of - 
Wicrobiolocry Culture CoJlaction, University Albertct. 
Any otgani&ia c;hlch produces a j0-lELOtawa&e Bene-itivo to 

20 t^lavutanlc acid aay bs used as Indicator orgoirjiffiQ* ) 

Zone& of InhibLtlon appeared around tho fi^ aividana 

colonies upon Inoxibatlon overnight *r. 3Q*C vera evidonc^ " r 

Of alavulanic acid produotluon. clavulanic aoLd- ' *' 

producin<) colonies were found atsong'st these initial fi* ' 

i5 J.;^yidanr. trauaformanta at a frequency of about 12% * when 
pla^iQid vtth vaa isolated from one of tii^^a clftvulanlc 
AC id -producing trans fonw»nta and re-lntrodiiced into 

lividon a. tJnj Cr-etquency of clavulanio acid production in .?];.•, • ' 

tti^so 2nd rognd transfotwants wa^ *bout 4a-4&%. rigure 6 
30 5t)ovs 4 photograpli ot an agar plate bearing :?Jid. round *•" .' "• 

transforcaantc . T^oiiea of inhibition- are aeen as clear *. 
areas in the agar; tho£;o appear on the photograph as darK 
circular ar^as. 
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fiCqu^inuiny of ,t5 DHA VrACjm^.ni 

Or<lorc^ 0£ il^l^tiona were gBiierOtOCt as 

aeacTibed in Hxampls a weinq fraqiagnts of th* PKR insert 
i from the coemid clone K6I»2 (Figure 9) and siibcloned into 
th8 E. coli plasmias pUClia andpXJClls. overlappinij 
CrRgmcnts wer$ chosen which extended from Wfc* end of the 
EgbC gsRs downstraajn for a distance of about 15 XD ending 
at the EgllT Rite, vhe deletion generated traqiaenta wore 
\0 eequenced in both ori*jcttationp as cieBCTibed in EXdJnple ).. 
The seguenCO iit Shown In Fi-qwre 2. 

the present invetitipn Is not limited to tlie fcaturea 
15 of tno omtFOdlmftAtB described herein, k>Ut 3l/n:)*v3ea all 
variatione and modif lent ions within the scope of the 
claina. 
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ORF# 


S^2n location 


Ent3 location 


<bp) . 


5 IK of ORF 
(aa residues} 


1* 


10? 




l«5« 


S52 




2216 


3^37 


1723 


574 


^ 


3940 


34B1 


1542 


$U 


4 


5634 


6393 


M . 


314 _ 




(511 


75fi8 


978 


32^ 


6 


7S95 


9076 


llS2 


JSM 


7 


9341 


10 908 


166$ 


S56 


8* 


xom 


I2 29« 


1299 


433 


" 9* 




I3 3&3 


744 


24S 


10 


1J769 


14 9?5 


U27 


409 



^ Actertskc deruJtc ORFt which uc oriented in die oppodtcdincttoft. 
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FTOIIKE 


^ - i 








1 

1 

$1 


I 10 1 20 


( 30 


t MO i 60 1 qd 
C9ti9QaQ9aQ qc:93qcc99t cgtccicclcg 03 

fFnd of ORF ^ 
eggtgggtgc qQn^ccc^qTC AGCCCCCCCC 120 


■ 


121 


5A5GTTCCrS AGQAACTTCG 


C&GCGAC6GB 


GtCtGCGTCG CCCCCBCCCG 




lao 


*-•. 


131 


CTCCAGCAGG ACCGACCAfi^G 


CCArGfTCCG 


GrcccccraG tagccgatca 


rCCAGfiCfiTQ 210 




241 


ceTcricccc cecTTtTCftu 


TeCCGWiCTC 


CCCGCTACCG GTCriGCCGT 


ftCfi&CTGTCC 


30!) 


* 


301 




CGCCGGCGCC 


CrCGCTCACG CrCGAACGCA 


rcArcbAACC 






361 


CACCGAQTCfi ACfiATGCCCQ 


I^GCCCATCCC 


QQaacccT^G TGCGCcncr 


TOACCfiCQK 


«o 




A2) 


qGCICJ.CC AG C ftCG&G&TGCT 


TGAACTCGCC 


tTGcnCACC CTOGCSGCGA 


TBCACGCCAf 


480 




lei 


CACCABGGGC GACCCCrCGA 


CCDTGGCCra 


TCCGATCCTC tACGC^GLCT 


IGKGTTCTC 


540 






GCTfiTTCGAG ACGGGfrACGC 


rCCCCTCOAA 


G«T«&AGKCG tCGACGTCCC AQGTOCCGCC fi03 




eol 


GATGCCUAG GCTTCfrGtGC 


CC76CnCAG 


GCTCGACrCQ GAGAGCfTGC 


T^liGGAGTT «eO 




€61 


cacqaa<:aac crc-n&CAaQ 


ACTG&GCGAA 


GCTGTtCC&G AAGGTCGAOC 


CCCCGCftCAG 


720 


• • 


721 


CGTGAACTGG TCtTCfiTTCT 


CCAAGCTCTC 


*CCGT70ACA TCGGCCAACT 


TCGG^CAQTC 


780 




741 


ft&CCL&CTCC TCCMQTTCfl 


TCCCCfGCfG 


feAftCAGeGCC GCCGT08TCA CCAOtTTfiAA WO 


• 




ftetCGACCCd GQCG&GTAfiC 


GGCCCTCCAG 


CQCGCffQirC AT«CCGGA6G 


GCACCTTCQC 900 




99 f 


GQCeaCCAQQ ATDTTQCCGQ 


TGQCQGGGrC 


GAtGBQGACe ATCGCCCCQT 


rCTTCtTcGA «S0 




961 


GCCCrCCAGC feCCGCCfiCCG 


CGCCCCACrC 


GACCCGCfiGG rCGATG^TCO 


rCTTCACCGG 


1020 




1031 


CiTGCCCrCG dGTCCTTGA 


fiGtCfiGTCAG 


CT7C7TCACC ACCTCGCCGfi ACTCAtGCTC 


10B0 




108 1 


CAQSATCACQ ACCCACCGCC 


CCQCCCCSGA 


GCCfSCCMTG AGCTGCnGT 


CGTAGCfiCGA 


nqo < 




1H1 


CTafiA«6CCC fiCCGAGCCCr 


TGCCCCTCCr 


GfiQQTCGACC QCGCCGATGA 


TfiOAGQCGQC 


120O 




1201 


CrCfiAfiGACA TTGCCGnQQ 


Cf^rCOAGQAr 


GTCCBCQiCGC rCCCGCGACr 


T«A«GOCGAO 


1230 




1S»1 


GCTCTftCCCC CGAACCATCT 


&CGGATG«/.r 


CATCTCOCTG nSAACBCCA 


CCirCCACTC 






1331 


Cn&CCQCCG CCGACGApCT 


rcGcccrBG& 


GTCCCACGCG TACrCCCCGG 


CCCCCGfiOAG 




* 


13&1 


GGKAnCTQ ACGCTOAACC 


GTfcTCTCCAC 


crcGcccrcc GfiGrrcTTcr 


cccccercn 


m4Q 




m\ 


GSC&GrSATC TCCflTcriCG 


TCG«CTTGAG 


atTGCrCATG ACGBATTTGA 


TCAQCCACrC 


1503 




1501 


Cftci;Tr«rcc GQoaTGTccB 


rCAftCDCBGC 


ttUCWKG«G CCGTCGCCcr 


7CTCCCAG6C 
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168) CCCWOGAAG GTQIC&WCT CTCCGOCCfrC CCCCTCCACC TCGQCGJCCt CtGAATCtTT 1620 

leai cTWtcMCA AtcwGcrii* fGiajicctcA ATACtcuAGt tccACCGTCA ccoccAQLCCc icao 

1681 CaCOACCACC QCGQTfiGCC^ CLCQiiCCAC-G nCfcCCGficCt tTGCCUGCG GtOCCTCArC 1710 

* --B^ginnfng of OAF 1 

I7M1 GCCATAQTTd' fKGAArccK rcA?ggggcc aqgclatgcf) Qgcgctctcl -tt.cccUxU ^vm 

1 801 ccc990-tocc 9C9tUc<39,<3 ocflgt':Q099 g^ccgaacqg ogq^ctgyac co9ccgcico 1860 

^881 9cgQ«ec9it cccocctc-tt 3999^9^00^0 ggcacccgga cqs'gocco"* gqcQ«atcc 1*20 

\9ai ot99wnOT9 Q*«j«90flt,<:g ^■tcoc':?':)^^ tf.q,ccge.5a\ iggcfglogoc ctrAanaagc 1880 

iWf 9t.qQ«i«3cqg qg.uglogt:9a coooocgg^c ogdcccctga agggootlia clgaoltcgo 20^10 

20m gtcGtcgg^t tcQ^cgocgg algggcggtt c^gcwcijM ccgUactct tcgtceccic 

2101 ttcacaagoQ cicccggtac gtqgagaaga Qogcgtgoog ^CfCigcgtccf] ^tcaggqlig 2160 

B*gin- 

Z\^S tcgqgaffccg tccoccatga cggogcctgg tac^gocgga g1cg9gog« cgiitcATCTC 2230 

2221 CCC?GTflTCG ACCtCCCCCA GCGCCAAGCC TACCGCCGC7 CACGCtCTCC TUTCACCOTT 22H0 

azal QCGTGArCAC GGTtiKiliCtiA AGSTRTrlGGl (idriGTCGCC CGAGACGCC5 CQEKGATTCT 23^0 

2311 CnWACGAG GTCCACCCCA rCBACTfCST KTCACCCGC CACGAUTTCA CCGrMOTSF 8403 

2m\ CEccacrcAT iKCTcecGt ggatcaccgai iCdCCcctAG ccctgctggc ccaccctgbg 2?e& 

aWl CCCCMTAM ACCAdCCTCr CCACCfiGUr CCCCACGTCC C7CCKGACC GCTCfiCCCCT 252& 

£531 CAFCCCGCTC CCCGtGCA5T CCCACTCCC* CQACA7C7TC CCGAAMAtA CCCACCAGtS 2560 

258 f tCTfySACTCC GTGfiCGATCC ICfttCCCGAT eTCCTTQfAC GCCQHSBAGt TCCAGCWCC 25^0 

W\ CCAC&AGATC ^CC^ACCTCQ tCWCTCCC;; tiJT«AAWCG GCCATCACCQ A«tG«TCaG 2703 

1701 aCCCTCCTTC ArCTCCCTCC CGCTGSACCT CCTtQCCTtC TKGAGGCCA ICGACACCAC 2740 

2781 CQTCCCCAAC CCGCCMCGA ACACCCCGGQ CAAACCttCrC ttGtCTCOrCG CCfiACCfiCrG 2620 

2821 CCAGAACCCC OCCCACCAQ« CCGCCGCCC7 QCTC^CCOACi 6CCAASCACC CCGr(*CTCfi7 29SO 

2B61 Ca7CGCAHCG GC.CGCG*TCC BCrCfiGBCGC CfiTCCCGGCf A7CCQC5CCC 7GGCCGA0CS 2ftW 

SMI CC7GAACArC CCfiGTCATCA CGACCIACAT CDCCAftGGBT BTCC7CCWG TCBGCCACQA 3C00 

3001 GCTCAACTAC CGtCCCOrCA CCBGCTACAf GGAi;CGCA.^C CTCAACTKC CGBCGCTCCA 30^0 
3081 GACCATftTTC ftCCCCfiCrGC ACC7C07CCr CACCG7CCCC FACGACVAtG CCtACGACCT 3120 
3121 GC5WCC7CC AICTGSCAGA AGGGCA7CGA GAAGAAGACC CTCCG7A7CT n;CCCACCQ7 3130 
31 Si CAACCCGAK CCCCeCCrCT ACMGCCCCA CCTC5ACG7C QTCACCfrACC rCC7tGCC7T 32*(0 
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32^1 1 




rrcGAfi/iCCB 


CG&CCUCCTC 


CTtCGGC^CC 


AAGCAfiCGCC 


ACGACATCCA 3-303 




330 1 


GCCfiCT&CCC 


ccccecArco 


CSGAfiTTCtT 


BGCCBACCCS 


GAGACCTACd 


Av, 1 




3-381 




CAfl^TCArCG 


ACTCCATCAA 


CACCGTCMG GAGWGGtCG 




■ 

* 




CGAGCGCACfr 


ATCGTCTCCG 


ACATCttGCTT 


CTTCCfiTCAt 


TACtCTCTGC 


ICTTCGCCLG 3480 




34a 1 






TCCfCAL^CTC 


G«CGG(SCTCC 


TccwcriCG 








ctcccctccc 


A7CQCCGCCC 


ACAT Greece 


CCC^GJtCCAG 


CCCACCTTCC 


TCATCGCGGC: 2600 






TCACGGCCGC 


TTtCAt f L-CA 


ACAbXTCCGA 


CCTCQlAaACC 


AlCBCCCCGC 


TCAACCTQCC 3660 




wDW 1 


CATCftTCiACt 


frTCGTC&rCA 


ACAAC^ACAC 


CAACtfG£CTG 


ATCGA&CTGT 


ACCAQAACAT 3720 








CGCABCCACC 




cAAcnraGc 


GGCGrCGACr 


TCGTCCCCCT 37^0 


-' 




CGCCGACCCC 


JIACOGTCTCC 


AC<1CCA«CC 


CGCCACCAAC 


CGCGACGAGC 


-rBCrcCitCCC 3840 




asqi 

39D1 


CCrffCfiCAAQ tiCrCCC&AGC TCCCTCGTCC 

Eno' of 

CTTCCAtJCCG SSCCGCfTW fiCECCnCAG 


CTTCCTCATC GAECTCCLGG 
OfiF a--* aoglnnlno of 
CATCrCAtcA TQCt^GGCACC 


TCAACtACGA 3900 
ORF 3"^ 

GCTftirccc iseo 


-* 


39S1 


CC76CCTrCG 


GfiTTCCrCGC 


{TCC&CCCCA 


ACliGGCG&Cfi 


GCCfiOGCCCC 


C6GCPCGGK 4020' 




■11021 


TTCBCCAKC 


GQOGCAGCCA 


CACCGACAfC 


GACACGCCCC 


ACGGIGAaCfi 


CTCflCTC&CB <i0S0 






GCGACCCT«5 


TBCACftCCCC CrCGBTCGCQL 


CCC^ACCBCG 


CBGffiGCftCfi 


CTMCTCACC 4t^0 




4Ht 


a<2CQCQCCCA 


CCACC^CGGT 




CAGA7CTACA 


accbbgacga 


ACTCcrcrcc 42oa 




^20t 


crccracc^Q 


CC&QACCCOD 


GCCCCACGCC 


CACGCCOAfiC 


TGCTCC7GCG GCTGCTCGAA 


V ' - . . 


4361 


CQCTAT<iACC 


TGCAT«tTr CCGacTTClG 


AACG&GCdCT 


TC^CCACCCT 


OGTBCGaACC 4320 


1 1 • * 




GCGCACCQUa TCCraCTCQC 


CACCQACC/^C 


OCCGSTrCGC 


TGCCGCTGtA 


CACCTGTGTQ 4990 


t' 
»^ 


43S1 


QcacccAGca agctc eristic 


Gf CCACCf^AQ 


GCCAAGQCGC 


rC&CCfiCGCA 


CCOCMOCCG 4fl(W 




4^1 


AACCCCTTCC 






crcGccccrc 


TffACCCcrat 


CTACCA6GTG 460D 




4601 


CcceCG^GCG 




CAICGACC7C 


GGCTCGGGCA 


CCBCCGTCAC 


CCACCGCACC 4590 




4561 


TGaACCccCG cccrcrcccD 


CCGCATCCTC 


CCGGAGGC'Cfi 


AGliCCGTC^JC 


&CCC(;TCCfiG 4620 


i\ . . 
- », *• 


4021 


CDCCCGCTCG 


AGAA&fiCC&T 


CftCCCAGCGfi 


crcAcccccc 


CCGACACCCC 


GrTGGT6GT0 45«l} 




W} 


CTCTCCBQCG GftAtCSACTL 




GCfiGCCrSTfi^^ 


DGCACC(^GGC 


CQCCQCBGAA 4740 




m\ 


(^TGSACACCG TGTC&XTCCG 


CACCGACACG 


TCCAACQAGr 


rCCOCGAGGC CCfiGfiCGGTC 4600 




^Q0\ 


OrCGACtAtC 


TGCGCACCCG 




AtCACCATCC 


tGACCftCCQA ^CTGCTBGCC 4660 
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PTfJuns U - i| 



<\e61 CACCTCCCG7 ACGCGGItlft fiGCCICCGAG 

192) LTCCCICCTGA CA.<>CCCTCTA CC&GCDSDTC 

<(&et TACGGCttCGft ACATCCCCCT CQRGCGCATG 

so-Ill ACCGTTCTCa rGWCCAtAl G^tCACtTTt 

ClOI ttCJltStTGfi c&gi;rcac7q daccaclcac 

5161 CrCGTCrCGi: ICGAfiGitCG GC7CAACCCC 

5321 CCCQCQLAtGO CDGACKCCT CCfCG^CCA/i 

5281 GACCCC7CGG GMcciCGrc crccTfcrcc 

53<l) GACCGCG7CC ACCAGECSAA Vl^fiGC&GGTG 

G*[01 GCCCGACGtlC ACCCCfCtGA aGTGG^lCACC 
End of DRF 3--> 
A^CQCGCGGa GGfiCUCCTr^ Gtcccqccac 

SS21 94<e^e^tac ccg^^Q^cgc e^geg^octc 

U^^^^nceg tcg9cc«t<:g gcectcQcgq 
Beginning of ORF 4- 
^1 oggcc^ctgaa agc4C€«A«iC QCAlCfiACTC 

5T0{ CACCncdTG KMTfltCQC ACGATCCqc* 

ETfll AQCCCCCrAC aACGGSGftCA CCAGCTACCQ 

6«1 CC5CAG7GAC TCCGGCCTCA ICCACCaTGT 

Saai CCTQATCAAC TGrGTCCACG CCQaOQACAT 

QATCfkACAU ^CGCAQAGCC A7CTGTGGGG 

eaOl GATCfiGWCC aACCAClCCC 7flACCC7ftCC 

90$t CCCGC7CGCC GTQGTGCACC 7CCAC6G6CA 

a tat CCWTACCXt'CACfiCCACCC CmCCGBCA 

&tat OCCGAIQ^^C CACATCCCCA TCCfiCOffCCA 

B24t CCCtlGGCCAC GCCSTCCCBG TGGlCACfiGC 

8301 ^kCCdCCHkQ CTGATCCGITG AQAftaSTCCG 

6301 CCACCTCGIC QACCCCCCCf TCGCCCCCGG 

e^si ^rtcdcfiCGAG crc-cTGGcoc TGcrccecrc 

6qB» CGr&ATCGAG GTGTC/iCCCC TCTACQACPA 



TCCGTGOACC CdBACAtCAT CSASTACtTG ^920 
fiACG&GCCGfi ^.UCOCCCC^r CCTCACCGCG 4980 
CACCGCGAGB ACCeCCTGCC CtCGCKCAC SO^IO 
GACQGCCTCA ACGAGATGrC CLCGCrfiCfG 5100 
CCUTICTCCG ACCCQCAfJGT CCTCWrCTft 5160 
CCCCACQGCC GGGACAAGTG GGTCCTGCGD 5220 
ACCGTCAACC CGCCCAAfiCT CCGCC7CCAC 5280 
CCCnB<:TGC TGGACCACG& TfiTCGCCQAO dS^O 
ftT^CtCCACr TQTTCGATCr CACG&TCGGtt 5^03 
CACCATCrGC TfiCSCTCCfiT GGtC&ACCGfi 6^tiO 
ggggagcccg cciyijuCcjCO^ ^acccqcqcg 5&20 

CQQCQCacc^ Qcacccctgt cccccacccq S560 

cccc-tqacga ccgVc^ccc^ ciMcceaQgo S64C- 
■ > 

GCAMTnCA CCCCCCTftCG CACACATCCC 5700 
GCCCCGCCGC TATfiACCTCQ TGUTCAtCGfi 5760 
TCCCGGC^iCC CQCrTCfiGl^C CCCAQSCCAf ESiSO 
DGfiCATCGAC CCCC^CCCCG GCACGHrCA 5690 
CAATCrSACG CCairCGACA fQAACATWC fiS'K) 
CCTftCrgAAG ftCCAACGCCC CCTncTGAT 0009 
C&CCC7i;CCC tCCCTCGCGC. AGCA6CACGG. dOfiO 
CrCCftACACC AACCCGdCCT TCTACGGCQ^ 6\20 
CGGCATCGAC GAGAACCTCA TCGACCCCGC QfBO 
CWCCCGAAG CCGGACTCfiC TCGACTACGt e^^0 
GGACGAGTTC G«CGAGCT6a «G6TG6GCGQ d^CO 
CCACCGGCCt: CTG7ACS7CT CCQTCBAtAr 5360 
TACOBflCACG CCCGCGCCE* fiCGftOCTCCT 6420 

ccrcGCTGAC crcAAGr;cDe rcGCcncGA swo 

CGGCGUGATC ACTTCGATCC TCGCCACGtiA 65^0 
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GATtt^CrCCli 
&601 ocatcgtgtc 

6891 OCTC&CCTCC 

6721 CAAGACi:cr<} 

6781 CGTGGl^CAaC 

«e1f CCCCGAGACe 

.6901 CATfiCrCGCG 

ad&l GfiGCiVCOGTC 

7021 CrCCOA^AM 

7061 CGTCAT^CrO 

Tt41 CCrCCGCMQ 

720 { CGTCCCCTGC 

72ei CAAcarcA/kG 

7d21 BCTGOCGCCC 

7361 CCATOTCACC 

7141 CACCACCCAC 
7SC1 CCQC^TCTAC 
7WI CACCATCTM 

7621 .eggoaccggc 

.769 f .t^t^CCQ^gt 

76Q1 acpcc9\9co 
76Q1 tcgccgl^cc 

7flai ccGcccAnc 
Teat c&cccrccTc 

5041 CQCCCGGCCti 
6101 GCTCCTCCra 



El«glnnln9 &f OR? 
ArCGCCTCTC rCATACrTGA 

GAStTTCCCG AfiGTCCCGC^ 

frCCGCCCGfC TCCCGCACGC 

MCGACOGn ArCrGCTGCT 

LCGACc rc CA ccc cccc cc c 

CTGGCCi3tiGC dCCOCCTC^G 
TAWWQACe TGTACCtGlC 
CTGCUGAGT rCCACACCQA 
GCCTfiCTCCC QCGCGCACCd 
CWrcCCCC TGCTGCACGA 

rGCCTiniACd TiificcrTccc 

DCGCrCTACfi fiGfiA^GCQAA 

GAGSACCCCQ C^GACAAQHiA 

GfCGG^GKA ACCTCdTCCC 

GCGCGCACCC CliTTCrCQCC 

ArccacACCQ accccaatqux 

End GRP 5--> 
TTCTCGCCQC GCCGCffrAgc 

'tccc9gtcct 9ccecetcac 

tadtiititgcc actctot^cj^ 
ctttccgtga co^gcgocgc: 
GrGcrGCACA CGGCGCCCCr 

GCCTccAcce cccc cccc AC 

AGCGTCarCC TQICCCGGGA 
GCCCGtAitCG CGAATCTC^^C 



pMti of ORF 1--> 
MA&CCCXC AGAACCCACr 7tTSAd$g^4 8600 

CTGCACCCCG TACCGCGACG AGCrGGTCfiC 6660 
CfrCCGACCTC CA?WCTTCC TC^ACCAGCC 6720 
aCTGGCCGCC CCTCTCCACA CCITCaACGC 67S0 
GCGCffG^CTG CCCGTCCAtG ACAGCGAGCT e8'*0 
GCTGtACCBC AA^CCCrTCR T&ATGGAGfiC 63C0 
TCrWACACt WSTACCAiXG ABCTGCBCfC 6«60 
GCCCG^CGCG CACTACCrifT CCTCACACAC 702l> 
OATEGCGrAC CACATCCTCC AGCCGAACTA 7090 
CUAGAACCCG CCQGAGACGC TG-GTCCGCTC 7\^0 
CAAGACCCGG GCCCCTCTCr TCGACC4CAA 7203 
CGbCGGG^TC GACaACCCGG GCCCeAfCGC 729^ 
CWCCCCnc CTC^CCTACa ACCGCGACCT 732^ 
GftCCGTCGCC CATCTCrCCC AGGCGCTCGA 7^60 
CGOTOACGIC CTCATCATCC ACAACTfCCJ! 

CCCCrGGGAC 6GGAA0GACC GCTGf^CrCCA 75DO 

ACAGCTCrCC 0CC4GCaA«C GCCCG(}6CGA 7Ee[> 

ccggctcccc gqg^iccctgg gctceg^cgc 7620 

ccgccgcgp^ ^^i^^^^^^q^ cog^cccclt 7680 

gceggg^cgg gggygacggc ggoggt^ccc 77^0 

tgctf^ acag cttctccQiqt gccgCgegcc 76C0 

Boqlaalyco' gogtocQQCg ggCgaggccg 76dd 

Beglnnrng of ORF 6--> 
lgacA7tTCC GACA5CACAC CCAACACDCC 7920 

CGGCniiGCC CACMCCCCC CtCACGACn 7960 

CQTQAGCGCC CTCTTCACCC GCtCCCfiCn 

fiGCGGTGCCC GAC5.GGCAGG CGCGCGGTCiT 6t00 

CACCGttCCTG- GACCCCCAGG AGAAC5CGCG QtfiO 
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i'luxjiiK a - ^ 

EmJ of ORF t|--' 

(iS^^ GAJt6\i\t^fi OAACTCCTCr ACCACTACGC CCSABCCCAC AGAACCCAGT TffTSila^gaQ 8800 

&601 ocQtcgtgtc ArCGCCTCTC CGATAcrfGA CTGCJ^CCCCIi TACCGCGACG AGCrcCTC&C d<$0 

6981 tJCTCacnCC GASLtlcCCG AfiOTCCCGM CCCGGACC7C MTOtiCTTCC TCMCCAGGC 6720 

6721 SfkAGAtilCT6 4-CCGCCCGrC TCCCCCACGC aCTGGCCGCC GCTCTCCACA CCITCAAOIC 07BD 

d7Bl CCT<;C<;CAli{ frACCACCCn ArcraCTSCr GCUCfrGfiCTQ cccgtcgacc ACAOCCAGCT W^iO 

Wi CCDCGAGACG CCGACCrCCA CCCCC^CCCCC CCTGPACCBC AA^CCaHGR T6ATGGAG«C &9CQ 

«991 CAKCrCBCG CTGGCClSfiCC CCCGCCTCOC TCRCACACff fG&TACCAClG AGCtGCBCrc 

8991 GfiGCACGGTC TACCACQACG TGTACCCGI'C GCCCGfiCfiCC CACTACCr^T CCTC^GiACAC 70S(> 

7C21 CrCCOA^ACG CTGCr^GAGT TCCACACCGA CxATSGUrAC CACATCCTCC AQCCGAACTA 7080 

7041 CBTllATfeCrG t^CCTfiCTCCC GCGCGSACCft C^AGAACCCQ GCGGACAC.GC TCGTCCCCTC 71(10 

7141 MTCCCCAAa GWCfGCLtC TGCICCAtGA tAACACCCGG GCCCfiTCTCr TtUMlCWCAA 7203 

7201 GGrccccTGC TGCCTOQACQ TnacCHCCG CGbCGGGfiTC GACCIACCC6G GCfiCSArCGC 7293 

72ei CAACarCAAG CCGCrcrACQ &GQA{GCQAA CaACCCGTTC CTC«CGTAC(; ACCGCGACCT 73S9 

GCTGGCGCCG GAGfiACCCCQ C9GACAA<IGA GfiCCGTCGCC CATCTGrcCC AGGCGCTCGA 7^60 

CGATOTBACC GrCGGftGraA ACCTCfiTCCC CCOrSACGrC CTCArCATCC ACAACTfCCG 7440 

7m CACCACGCAC GCSCGGACCC CQTTCTCDCC CCGCrGGGAC fiCCAAQGACC GCTGCCrCCA 7S03 

7S01 CCGCGTC7AC ArCCGCACCO ACCGCAAtQ^ ACAGGTCrCC QCCftGCQLAo: GCCCfiffGCGA 7560 
End ef ORF 5--» 

7(61 CACCATCTCff TTCTCGCCGC GCCGCTG^Agc ccggctcccc gagciccctg? 9cccc99«9« 7^20 



" J:.. 



•V' 



7921 CQQoaccggc tccc99tcet gcceeetcdc cegccgcg^g g9^9«999g9 cog9C£.cclt 7SB0 ^t'jf-\ 

769} t9l9CCQ99t gccgtgc^tc c 19791x9^91 £9999^:99 99999ac99C 991199 tgccc 77^0 

77<l) ggc99CC999 tgccstgegc CQCccgtgqg t9ct7i<ai;ag cbdtficgtgt gccgCgegcc 7800 

7601 Qcpcc9^9ca loaatltgcc Qctctot9g9 oaqlanlgccr 9a9tgc9gc9 ggtga^geeg 7^80 

Beainm'nq of OXF 6--> 

7881 U9cc9(9«e etticcgtgq ee99q9ocq€: tgacATbTCC CACAGCACAC CGAACACGCC 7920 

7«ai acGCCAnc GrG^rccAcA CGCCGccccr cGGcnnGcc gac(Jaccgci: cccACGAcn ?9ao 

I 

7641 CACCCrCCTC GCCTLCAtCG CCCCGftC<AC COrGAWGCC CTCT7CAWC GCtCtCGCH ftMO 

8041 CGCCCGCCCU AGCCTCarGC TQTGCCGCGA fiGCGGTG^CC GACG.G&CA«& CGCGCGGTGT 8100 

6101 ^QTGCTCCra GCCCGCAACQ CCAATCTCCC CACCGfiCCTG GAGGCCCACG AGAACCCGCG 8160 
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l^iQUrtt: - G 

8161 COAGfrTfiCOC CACGCCGrCS CCCQGGCCCT CWGCtCCCB GAGfiGWAGA 7GC1GATCSC 9220 

CTCCACCGGfi liTQATCaccc. GtiCACTAClTC OATG&iKCA^C ATCrGGCAliC AlltKAAOAC SSaO 

£281 cCTG^AaTGfe CCCCCCCGC-G ACCCCCGCTT C&ACCCCBCG GCCCCC&CCA TCtlTCACCAC i3%0 

OaiH tGACACCCCt CCCAAfiPAGG TCCBCBTCAQ CSTtBGCGGff C-COACCfTCC fGUfiDATttt 9*100 

atOI CAAGGGCGTC GJfCATl^ClGC /iGCCCCACAT GGCGACGC7G CTSADCFTCT rCQCCACIiaA 13480 

BHSi CGCCCEGCrG GACCCGGCC^ AGCA&CACCC CCTCfTCCGC CGBGi'CATGG ACCOC^CCTT 8620 

652 1 C/iACGC&Q7C AGCAKGAPA CCfiACACCTC CACCAGCfrAC ^.CCCCGGTGC IbFTCGCCAA aSfld 

aSei CCGCCTdG-CG QGCGAfrCTM AC^CfiGftGCA CTTCCAGCAG ftCGCTfiCACA CGfiCCCCGCT a64D 

dQII GBCCCT^GTC AA^GACATCC CGACtCACGG P<XASGGCBCG GCCAACCTGA TCGAGGtCCA 3700 

&70\ &GTCACCGGC CCCCCCGACG ACGCCCAfiBC CAAGCGfiCrC QGCiUGACr.G ICUTCAAGTC 3760 

8761 CCCarTGfiTft ftAGACCfiCCG 7CCACCGCTG CCACCCGAikC TGSGGCCMG TC^CCATGaC 8320 

Best GATCOaCAAG TQCTCGGACG ADACCGACAT CGACCAGOAC CGGGTfiACGA TCCGCnCCG «a80' 

StB\ LGAGGTCGAC GTCTATCCGC CQAAOGCCCG GaGCQ'^CCAG GCCftACCACG CGCZfiCSGCG SMO 

fl9q1 CftCCarCGCO GAdCATCrEC WfiCtGACtiA GCT^CrCATC CQQATCGAtC 1 CGCCArCGC fiWO 

9001 QQACGQ&GCC TTCACCGtCt ACGGCTCCGA CCTCACCGAI^ GQCTATGTGC G(^CTGAACTC «0S0 
EM of QRF a—* 

9061 QQAGTACACC ACCTf'^tece e^aaccrgs^o aci$qgccgzc gccccgtl&e ctg-tcc^ctc &120 

9121 cc^tcec^Xg tggttotDCC gaccgttccc cggctatgc^ cac^ggoc^g cigc^Qcce^sc 9180 

9181 acc^^Eicccc gccc^g&e^fr adgat^er^^c) gcgatgcaog g'tqacgaggQ frdggagggcic 9^40 

BMl-nnlng of D8F 7"-> 

^2^1 Ar?GAGACCA CTCGCTCGAC GACCGCQQAC aASGGCTTCG ACGCCGGG&T ACCQGOAGrG 9300 

9301 Q7CGCGC£GA CCGACGCCCC C&GCGC6ACG CTGCGCCTCG 7CCBCACGftA CGACTTCGAC 9380 

9361 TCCCrCQACr CCC6CAACAC crACTACGCC TAUCCTC&A ACTTCCTCCG GCTCATCQEIC 9120 

942 r CG5ACGCrGK TCACCTTCtGA CACCCC£CCG CCCAACGCCG GCGAGCCGLT CCTGCCCGAC 04B0 

9tai CTCGCCGAGT C4CTCCCCBA GKCTCCSAG GACGGCCGCG TCTCCA^CfA CCGGCTGCGO 95<!0 

954(( (SAGGGCCTGC G<: TACGAGGA CGCCACCCCG GTCGTCfCSG CtGACATCAA GCACBCCaTC SQOQ 

9001 GCCCGCAQCA AtTACGGCrtC tGATGTCCTC GGCGCCBGrC CGA-CCTACTT «CCCACC fC 0090 

Gddl CTG&GCACCG AGTACCGlCGA CCCCrCGCGC GAGCCGGAC^ CCCACQGACC GfiTCACGCTC 9720 

0721 ClACACCCCGC AcGAGCG&AC GCrCGTCTTC CGGCTGCGGC AGCCGTTCr;-C CGGI^ArGGAT 9750- 

9781 CTGCTCCCttA CCATfiCCGTC CACCACCCCt GTfiCCCCttCG ACCCC5ACAC CGCCGCCGAG 9840 
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2108113 



VLOUHli 2 - V 

9&1I TACCGfiCrCC &GCrCGTCCL CACCfiGCLCG r^CCCCATCfi- TCTC&fJiCAt CCGGCttCQAtl 8dC0 

9901 CTQBCCGTCC fGCWtCCAA rCCGCAC7CC tACCCCGACA CCGACCCRdT GCGCGTtCAC Meg 

9«ei CSCG^CTCCC MATCGAC<GT nCACCTCHGC AAGSAlCCGC ;iCGAGQTGCA CC&CATOCTO tC020 

10021 CTCGcaccc(7 Ar^GcccATei c«jiccTccc£ c^cncoGTi; 7GCASCCCGC cdcccAGGAB looaa 

IQOSI CGUrCCTCG CCGAfiCCGQA GCTGCCCeCB CAClSCGGACA ACCCGCTCAC CCPC7TCACC tOHO 

rOMt TCCATCTACr GCCTerWAft CWGAFCCCC CCCTTCGACA ATtTKACTG CCGfiCfiftGCC 10200 

10201 HTGUfiTTCG CCACC&ACAA JkGCCecCATG CAgG/^CG^GT ACGffCGGC^C CCTCCGCSCC )OSBO 

10261 GACATC6CCA CCACCCTGCf GCCCCCWCC CTCCACCCCr ACAACCACn CWCCffCTAC 10320 

10321 CCOOrCCQCC CCMCCQCAC CGGCCACCT<} GftGGCCQCtC GC&CCGACCt GAJK^CTGGCC 10230 

m&\ fiGGATtiCCCG ACGGCTTCCG CACCABGATC tiCCGCCCGCA AGGACCGGC1 CAAIiQABfAC 10<R0 

)04<j1 CGiCftCCGtCG A«GCGC7aQC CUCCQGGCTC GCCCG^GTCG QCATCGAGtiC fiUAGCTQCT« 10500 

10501 GACnCCCGT CGGGCGACTA CTTC&ACCGC TACfilGCGGCr tiCCCi;GACTh tcTGi^GGGAG lOSSQ 

10561 CACG4GATCG MATCATCAT CTtCGGcTGl GGCGCCUtT TCCCCftAtGG ArACGGCTTC 10820 

10&21 liTCCAftCAQA fCACCGAC&G GCGCGCGATt AAGGAGCUC iCAACDAGAA CAjCfiGCCAG (CdSO 

|06B» CTQOACGACC C6CACJ)TCAA CQCGCrSCTG GACGAGCGGG CCCAGT6CCC CG&CCCSQCG 107110 

107f|1 C66CGCSCQQ ASATCTGGtA CCGCATCGAC CAGCTCACSA tGOACCACGC GSrcAfCGTi lOQOft 

lOaOl CCaTAWTBT *-CCCGtCUTC CCTeCTC7ftC CftGCACCCGG ACACCCtCAA CCCtlFCCTC 10960 

10B61 ACCG^CrCCT 7CGGCA7tnA C&ACUCQTQ GCGCTCGCCC CGAAQtGAgc ac999gUc9 10930 

10921 9ccc-c999ac cgtoit^tccc ^^ggccggac cccg««cgU cecc^cccQg tcc^$tc«gg 109BQ 

t0»Bt OCCC99UGO g9COC9c7CA GCCGGACA7C CCGGCCCCGG CCGC^CCCC GCftCC«GATC 11040 

110*1 f GCCCWT6GC CCTCMCCAG WGCCGTrCC ACGCTCCCCC A5GWAGAGC GCK7CGC6S IJlOO 

(1(01 AACrCCOCCr CC7ACAGCCt fiA4CTGfiCGC AOGAACTGCC GG07CCGGCC; QCrUGftCrG IHfiO 

^1191 GTCtCCCGCG GGCTGCGCAB CAGGAGCCGG GCGCCQACiGG ACTgCTCCAG CGGGYgAATC f1220 

11221 CGQLCGGQTGA CCGCCQACTG BCTGATLGAC AGCACCCCCC CCGCCCGGn GATGCTCCCft 11SB0 
1l2ft1 TGCCfiQGCCA CGfiCGTACAG CAGA7CCAGA TCftTCCACAT CCACTrrCCG CCCCTCGfiCC 11340 

1)341 TGQGCCGGCA 1:GCA0CCCTG GTCGG^TCCC CCCCCOAAGC GGCGCaCCfC CdCGCCGQTCi II^IGO 
MIOl CGC7CCCCGr ACCAUTGCGC CCACCAGGCC TCBTCUGCA GG7CGCG&Te GTCTTrGCCG U^QO 
1148) AAQCGCCGGA GCT«GACCTC aGCQAfCAGC &CGGCCAGCC GTCCCftCDAG CCtCCCGftGCC 1 1&20 





: .*• * y 



. .'Si 
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1153> iCGATeeTBC GCTCGACOA^ CASACrCQTe ftfCC^QCCCi; GqCGCTCCCC CACGGAfiCCC MSSD 

VISaV CCCACCACCG AGfiGCTCCTfr CACCCCCGtG TCWriGCrt AGCCEAGACC TATtGCCTCC IWO 

CCaCGGCGCJl GGAJfiCCtCC BGDiACCfSAT CCtCCCGTGA 7C7CGACCCG GGTGGCCGDG MTOO 

11701 G7GAffCCCG5 CCAGCrCCAA GACACCrGTC ACCA.GSAIC7 CtGACCCUCC 7CCCCTC1CG ^750 

11701 aACACCCAR* TCtCSrcCCE CAEiArCBGCG AGCIJA&ACCT CCCCCC-GGiQC ffGCCAGCGGA 11820 

lUai TGCr^CCGfi* tCAG&AlCAC CCACAfrCCG* TCGTCrAGCA CCTCACAGCT CCGCACG«AC liaftO 

ItBSi CQCrCCWGC rGTGCtCtidG GCACrfiGAGtf CTCCAGOTGT ACCCCCCGTC CACCTtClAG n^^0 \ * - . 

ngflif CCCGCCAtttt SGGCaCCfiAC CTGCTCC^GG GCCrCGfCCC CriACr.(iACAfi CAGCAfcGTCC 12000 '. - ' 

I2O01 AGCCAGfiCCB CCGC&TCCTC CfcCtftCCTCG TWACCACGft GTrCCCTHGA CACCAGCCAC I206Q . 

laoe^ ftGCACCTCCG GCGCCiTCCfcC. QGCCTrCGAD CCATUGCCGA AflATATCWT CCGC^JCBGCC 12120 - \ 

I2(2r A(JG7CGACCT GliTGGAlGAA CCCCCGCCCG tCCACCACCA TCCCaOAGDC CGtGGTGCTC 12130 .'. 

12^31 AGCtGCCCCG TCTMCQCCT aCGCAGMTC AGCGCBAGfcC CGACfiATCtC rJCCASCCaS 12240 

122*11 TCGAGrCTBC frCTCCACMf CCCGrCCCGG ACACCCGTCC GtCGfiC-CCAC fTCCATffqgq 12300 ~ ; y ' 

12301 tft-tccflcogi gtcceaccgc gtceo^locua gdeagflrtCQC dtcMctgoc acca9cqg<ic 12CW0 

12361 9tcg9ttctg acce^a^oKja caatgtcgQt tccctttlcc ^icociggncl.'ataccQCtga 12^20 -.-Li 

12121 attgtccgao ^t^^ctcttg oqt^gc^icg qtiatcqat^Q taggcogcgc cg-ctct Lcgq \Z^QO , \ 

(218 [ atlctcetes eegggoagcQ gaacgcgccc ggcc-99at9g cgg^cgc^ct ccgggcgcc^ 12640 

12$(^l tccc^QQaac gggggocgg? gccrcQrjcocQ accagccacc cggiccgggc ficgcg^cq^'a 12600 -■ . *• 

*--end of ORF » . " " • • 

I2WI gacctggteg gcggacgw^ arCAGACCFG «TC<1GTGCCG COTATG/AGA rCTCGTGCAC 12660 - , - : 

\2W\ DGrCGCGr^G r«C<<GCeiCGG TCAC&GCCTA GCGftACCGCC TCCGCGATGT CCTCG6CCTQ 12720 

ia72i UGCriGCCC ATCTBOCTQA TCCQCTGCrC CTACATCTCC trCGTGGCGC IBTWCtGAV 12790 •* " 

J27ai CTGGCCGCGC; AGCTCCGTQT CGCtTGHrCCC CGGCTCCATG AC&AC?ACCC GCACCrCGCG 12640 [ 

i2Ati\ CTCQGTaaCC ttCTCGCCCA CCftTCTCCCT CAAWCOTfC ACACCWACT TCCTGQCC7G 12900 

ia«0] BTACACGGCC CCOrTGCSGA- CGlTCADCCG GCCCGCBAfC ftAGfiACATCT GtACCACG^T I29W 

12961 OCCCTraCTC CBCAGCA«AT CG6GAAG4GC CCCCCQRCITC A^GTACAfCA CGCCCACGAG 13020 

13021 AT7«Gr(J7CG ATCATCWGS rctAGTCfiBf GBTGTCGGCft rCC7CCACCG GGtWACCAC 130BO 

laoai CATSATCCCC GCCrT^TTGA CCAGGAT^TC OAOGCCGCCC AGC^CCfCCA CGC7CCAGGC 13140 

UlJtl GACGCMCCG ICCACtCtCF GCCCGrCGGC CACGTCRACT TCGAGGACAf «GACC7TCGC 13200 
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(3201 CCCanCGfiCG DfCAGCTCGl CACCCW/KjC GCGtAGCTTC TCCMCCaCiC DCBCECCGAT 13e«U 

)32Qt (GCCAC&GCG riCGCCCfCGC {6eCCi\&(»C fiCfiCeccGTQ CCCTCGCCCA TCCCCGACCr 

\---begiruilfKj flf OIV -9 
13321 CGCGCCCGTG ATGACCGCCA C-TTTCCCCTG CGCATcoltl ctlccaeatq 18350 

13^ai ^t^ctg^^at <;9'tQ9t9a9c qtal9°<^oflr qqqqiqogQ*: cttjee^tgcc ^g^QCgcigtt 

13141 ccgtacgcca gqcccjt t^^g gtg^gcgcq^ rcgue<i^9gt acqgot99cc c^qgtLcccc )'36CO 

IdSOl ggggagltcc cggggao^gg tgaalaccgc g^c^ctctcc gaCg^fcCtc ggaggacocc M£60 

13fid1 cg^g^ottca ccg9go<i^cq gcggec^^ag t-tctci;ccgt ccocggcdgci c^clQ-tcagc U620 

(3821 9<c9caltcc ccq<}^9(»l>^ cCvMcg^t^ go^c^ggt la Cgact9i't<c CQCcgggUa 13630 

tgcgcqecgc ece-ggncigac cgQccocccg ccc09g9ac-t gcggcagatf g99C9CC0cg I97RO 

Boginning oi 6RF 10 — » 

137^1 D<tdt4g<:g£g ogcaocgo^c ggcg^t^^Ar CATChACCMl CCAGCCCCTC AGTCCCACCA tSdCO . 

U901 G^TGdCACtG CC&TArcCOA TGtACC^fifiT CTGCCCGCTC GACCCGCCGC MtAACfCCi Uaca 'V--' * 

Uaei CGGGC1GCC9 TCCCAGAACG MGCGAGCCG CiGrGACGCTG 7&&GACQGCA QCCAOfftGrti 13de& ^ ci"' 

1392 r OCrGGTQACC rCGCACGCCG Q&GCCCBfiGC CfiTCCT^&C OACCGCCeC^ TCACCCOGBT f^SBO 

UCtUdC&CG CCCCGCrTCC CGArCClCAC CCGCACCTCC CAACTCGTCC GCQCCAACCC 14010 

MOfll GQAftTWGCe TCGnCATCC BC4T«AC6A CCC&CAGCAC rccCGGCTtiC BCfCCATCCr 14100 

14101 CACCCGSGAC TfCCTIiGCCC QCCGCGCCGA GGCGCTGC6C CCGGCGGTGC GCGAGCTCCr H1«D 

14161 C&ACftAGArC CVQGUCftCCC TftCTGAAGGC CaAaCQ9CCCl GrCQ&CCrGG TCGCCGOACT 1^220 

tf221 GACOATWCt MtCtCTCGC OGGTCAICXC CtTGCTCTrC QGCCCCGGrc ACftACCfiCM (4200 

l««l liCAGnCATC GAQCACCacA GCGCGCTCCr CATCPACCGC CGCTACACtC CMWCAGGT 14310 

iq34t CGCCAA8QCC CGGGACGAAC TCOACGGCTA TCTGCGGfi&fi CTGCTCCAGG AGCG&ArCQA H^OO 

\Wi MACCCCCCC ACCGACCTGA TCAGCCOGCT CGrCATCCAC CAGGTGCCQC C^QGCCATCf 1449Q 

14481 GCGGGTdGAG MGtAlRGrcC CGATGTCCCG OCTCCTGCTG GTGGtCGCTC ACGGCACUt 

14521 CACCAQCCAG GCGACCCFCA GCCTGCtCAG CCtfiCfCACC GACCC2GAGC -TCGCCCGGCQ KSdO 

14Sel CCTCACCGAC CACCCGGCCC TfrCTGCCCXA OGCCSTCflAC GAGCTCCm WirCCACTC 14640 

14641 CATCfiTCCAO AACCCGCTfiG CCCCTCCCGC GGTCMfiCAC CTCCAGC7CG ACCATGTOCT \4703 

14701 CATCCaGCC4 GGCOAGBdCG iBGTGCTfiTC CCTQTCQaCG Gt^CAACCGGG ACCAMCGGT 1476(0 

14701 crrccccsAC ccgcaccgbg tbgac&tgga cccccAcacc cqziquak rcGccTrcsG hbso 

K821 CCACG&CATG CACCAGT^C TGGQCCAGTC CCr<^GCCCGG CTCGAGCTG^ AGGAGATCCr 14800 
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210S113 

HBdl CGCCDCG&rC C7GCI«crt(;A Tt^CCCQGTQC CCGGCTC&CC GTGCCClTci: AGGAfiCTl^CA Wdt^O 

end of ORF ^0"> 

I45«l CirCCGTCAT CAGGTfiTCCA iJnACGGCCT CCGCGCCCTC CCQBTGACCT eGTGA^cgQC 150CO 
IGOOl gtggagcggc t^accgtcci't cc1.cc>acgcg ^cggcclgcl yc^cgot^gg qc^ctgcqc^ 15C360 

I to I 20 I 3a I io I 50 1 «(7 
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V 



Ba>9A 



11.6 kb 



/A 



pctAB pobC, 



XT 1 1- 



7^ 



EcoRl 




2kb 
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/ / n — 1 I L 



11.6 kb 

I 1 . J 



wool/ -K^i^WtWl 



.* / .'** 



2kb 
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em 
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, eo 

S. CI. CLA v*ridahvaprYCiq^lptJ=iiALPhrfpQPrgyDV--VvlGaPvOq9TS/RpCA«f^ 

ARC HU— " -aKpkpUUGAPrSXGQPaCtVFJCGPDaLliKAftL —ye 
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